Challenges in Automated Deception Detection in Computer-Mediated Communication

نویسندگان

  • Victoria L. Rubin
  • Niall J. Conroy
چکیده

Deception detection remains novel, challenging, and important in natural language processing, machine learning, and the broader LIS community. Computational tools capable of alerting users to potentially deceptive content in computer-mediated messages are invaluable for supporting undisrupted, computer-mediated communication, information seeking, credibility assessment and decision making. The goal of this ongoing research is to inform creation of such automated capabilities. In this study we elicit a sample of 90 computer-mediated personal stories with varying levels of deception. Each story has 10 associated human judgments, confidence scores, and explanations. In total, 990 unique respondents participated in the study. Three analytical approaches are applied: human judgment accuracy, linguistic cue detection, and machine learning. Comparable to previous research results, human judges achieve 50–63% success rates. Actual deception levels negatively correlate with their confident judgments as being deceptive (r=-0.35, df=88, p=0.008). The best-performing machine learning algorithms reach 65% accuracy. Linguistic cues are extracted, calculated, and modeled with logistic regression, but are found not to be significant predictors of deception level or confidence score. We address the associated challenges with error analysis of the respondents’ stories, and prose a faceted deception classification (theme, centrality, realism, essence, distancing) as well as a typology for stated perceived cues for deception detection (world knowledge, logical contradiction, linguistic evidence, and intuitive sense).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Study on Deception Detection Based on Classification for Chinese Text

Deception detection on Chinese text is vital to the safety of people's life, the survival of enterprises and the stability of the country. The expansion of the Internet has significantly increased the amount of textual communication received and stored by individuals and organizations. Inundated with massive amounts of textual information transmitted through Computer-mediated Communication (CMC...

متن کامل

The Effects of Group Member Experience and Task Complexity on Computer Mediated Collaborative Groups Facing Deception

Individuals often work together in environments where computers mediate their communication. One key area affected by computer mediated communication in group settings is the transmission of cues to deception. If cues to deception are filtered, such as they are with computer-mediated communication, deception detection accuracy can be hindered due to the lack of cues available, and group task pe...

متن کامل

Guess Who? An Empirical Study of Gender Deception and Detection in Computer-Mediated Communication

The verification of an online conversation partner’s identity is a challenge due to the lack of verbal and visual cues in computer-mediated communication. People must constantly assess the identity of whomever they are communicating with based on limited interaction. This poster describes an empirical study that identifies how people attribute gender and detects gender deception in online text-...

متن کامل

0 - 7695 An Exploratory Study into Deception Detection in Text - based Computer - Mediated

Deception is an everyday occurrence across all communication media. The expansion of the Internet has significantly increased the amount of textual communication received and stored by individuals and organizations. Inundated with massive amounts of textual information transmitted through Computer-mediated Communication, CMC, people remain largely unsuccessful and inefficient in detecting those...

متن کامل

Survey on Perception of People Regarding Utilization of Computer Science & Information Technology in Manipulation of Big Data, Disease Detection & Drug Discovery

this research explores the manipulation of biomedical big data and diseases detection using automated computing mechanisms. As efficient and cost effective way to discover disease and drug is important for a society so computer aided automated system is a must. This paper aims to understand the importance of computer aided automated system among the people. The analysis result from collected da...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011